CDS
Accession Number | TCMCG075C11704 |
gbkey | CDS |
Protein Id | XP_017973215.1 |
Location | complement(join(36043941..36044046,36045041..36046830)) |
Gene | LOC18606908 |
GeneID | 18606908 |
Organism | Theobroma cacao |
Protein
Length | 631aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_018117726.1 |
Definition | PREDICTED: probable carotenoid cleavage dioxygenase 4, chloroplastic isoform X1 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
COG_category | Q |
Description | carotenoid cleavage dioxygenase 4 |
KEGG_TC | - |
KEGG_Module |
M00372
[VIEW IN KEGG] |
KEGG_Reaction |
R06952
[VIEW IN KEGG] R06953 [VIEW IN KEGG] R09682 [VIEW IN KEGG] |
KEGG_rclass |
RC00912
[VIEW IN KEGG] RC01690 [VIEW IN KEGG] |
BRITE |
ko00000
[VIEW IN KEGG] ko00001 [VIEW IN KEGG] ko00002 [VIEW IN KEGG] ko01000 [VIEW IN KEGG] |
KEGG_ko |
ko:K09840
[VIEW IN KEGG] |
EC |
1.13.11.51
[VIEW IN KEGG]
[VIEW IN INGREDIENT] |
KEGG_Pathway |
ko00906
[VIEW IN KEGG] ko01100 [VIEW IN KEGG] ko01110 [VIEW IN KEGG] map00906 [VIEW IN KEGG] map01100 [VIEW IN KEGG] map01110 [VIEW IN KEGG] |
GOs | - |
Sequence
CDS: ATGGACGCTTTCTCCTCCTCCTCCTTCTCAAAACTCGCCTCTCCCACCGTGACACTACCCAATTCCAAAACCATTTCGACACACCCGGGACCATCTCATGCTCCTCACCTCAACATTTCCTCTGTTAGAATGGAGAATAAACCTCAAACTTCAACCACTACAACGAGTAAAACAAAGCCCCCAACTTCAAATATCCAAATTCCATCACTAACGGTCTCGTCGTCGATTGGAGCAGAGAAAAAAGCAGAAACGACGGTAACGACAAGGATGTTTGATACATTAAATGACTTTATCAATAACTCAATAGACCCTCCTCTACGCCCCGCATTTGATCCCAGGTTCGTGCTCTCGGGTAACTTTGCTTCTGTTGATGAGCTCCCTCCAACAGATTGTGAGGTGATACAGGGATCCCTCCCGTCATGCCTAGACGGTGCGTACATACGCAATGGCCCCAATCCACAGTACCAACCTCGTGGCCCTTACCACCTCTTTGACGGTGATGGCATGCTTCACTCCCTTAGAATTTCCCAGGGTAAAGCCACTTTGTGCAGCCGTTTTGTCAAGACTTACAAGTATACCACTGAGCAAAGTATCGGCTCTCCGGTTGTTCCCAATTTTTTCTCTAGTTTCAGCAGCATGCCTGCTTGTCTGGCCCGTGGTGCGCTCTATGCAGCTAGAGTTATAATTGGTCATTACAATCCTGCGAAGGGCATTGGTCCTGCAAACACTAGTTTGGCTTTGTTCGGCAACCGTCTCTATGCTCTTGGTGAGTCTGATTTACCTTATGCTGTACGCTTGACGCCCAGTGGTGATATAGAAACATTGGATCGCCATGATTTTGACGGAAAACTGTTGGCCAGCATGACAGCTCACCCCAAGATAGACCCTGACAGTGGGGAGGCCTTTGCTTTCAGATATGGTCTGATACGTCCATTTCTAACTTACTTTAACTTCGACGCAGACGGAAACAAACACTCAGATGTGCCCATACTGTCTATGGCCCGTCCATCTTTCGTCCATGATTTTGCAATTACAAAGAATTATGCCTTATTTCCCGACATACAAATGGAAATAAAGCCCATGAAAATGATTTTAGAAGGAGGTTCTCCTATGATATTAAATCCGGCCAAAGTGCCAAGAATTGGAGTCATCCCTAGGTATGCGAAAAATGATTCAGAAATGAGATGGTTTGATGTGCCAGGGTTCAACCCCGTGCATGTGGTGAATGCTTGGGAAGTGGACGATGGCAATGCGATATTTATGTTAGCACCAAATATAATATCAGTAGAACATGCCCTGGAGAGATCGGACCTTATCCACGGTATGATGGAGAAAGTAAGAGTCGACCTAAAGACAGGGCTGGTAACAAGGCAGCCGATCTCATCAAGCAATCTAGACTTTGCAGTGATAAACCCAGCATACTTAGCTAAAAAGAACAGGTACGTATACTCTGGCGTAGGTGAGCCGCTGCCAAAAATATCAGGAGTAGTGAAGCTGGATGTGTCCAAAGGAGAGTTCCAGGAGTGCACGGTGGGAAGCAGGATGTACGGCCCAGGGTGCTACGGTGGGGAGCCCTTCTTTGTTGCCAGAGCACCGGAGAATCCAGAGGCGGAGGAGGATGATGGGTATTTGTTGACATATGTTCATAATGAAAACACAGGAGAATCAAGATTCTTGGTGATGGATGCAAAGTCACCCAATCTTGACATAGTGGCTACCGTGAAGCTACCCCAACGTATCCCTTACGGCTTCCATGGACTTTTTGTGAAGGAGAGTGAACTCAACAAATTGATTCTAAACAACATCACAGGAGGCATAGTCACACTCAAGAGAAAAACAAAAAAAGGGGTAAAGAATGAAGCCAAGAACAAGTATAAGAAACAATTTATAGCGTAA |
Protein: MDAFSSSSFSKLASPTVTLPNSKTISTHPGPSHAPHLNISSVRMENKPQTSTTTTSKTKPPTSNIQIPSLTVSSSIGAEKKAETTVTTRMFDTLNDFINNSIDPPLRPAFDPRFVLSGNFASVDELPPTDCEVIQGSLPSCLDGAYIRNGPNPQYQPRGPYHLFDGDGMLHSLRISQGKATLCSRFVKTYKYTTEQSIGSPVVPNFFSSFSSMPACLARGALYAARVIIGHYNPAKGIGPANTSLALFGNRLYALGESDLPYAVRLTPSGDIETLDRHDFDGKLLASMTAHPKIDPDSGEAFAFRYGLIRPFLTYFNFDADGNKHSDVPILSMARPSFVHDFAITKNYALFPDIQMEIKPMKMILEGGSPMILNPAKVPRIGVIPRYAKNDSEMRWFDVPGFNPVHVVNAWEVDDGNAIFMLAPNIISVEHALERSDLIHGMMEKVRVDLKTGLVTRQPISSSNLDFAVINPAYLAKKNRYVYSGVGEPLPKISGVVKLDVSKGEFQECTVGSRMYGPGCYGGEPFFVARAPENPEAEEDDGYLLTYVHNENTGESRFLVMDAKSPNLDIVATVKLPQRIPYGFHGLFVKESELNKLILNNITGGIVTLKRKTKKGVKNEAKNKYKKQFIA |